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Abstract. We study the distribution of the outliers in the spectrum of fi- 
nite rank deformations of Wigner random matrices under the assumption that 
the absolute values of the off-diagonal matrix entries have uniformly bounded 
fifth moment and the absolute values of the diagonal entries have uniformly 
bounded third moment. Using our recent results on the fluctuation of resol- 
vent entries I31| . |28| and ideas from [9], we extend the results by Capitaine, 
Donati-Martin, and Feral fl2l . fl3l. 



1. Introduction and Formulation of Main Results 

Let Xn = ~jjjWn be a random real symmetric (Hermitian) Wigner matrix with 
independent entries up from the diagonal. In the real symmetric case, we assume 
that the off-diagonal entries 

(W N )ij, l<i<j<N, (1.1) 
are independent random variables such that 

E(W N )ij = 0, Y(W N )ij = a 2 , m 5 := sup E|(Wiv)y| 8 < oo, (1.2) 

where E£ denotes the mathematical expectation and V£ the variance of a random 
variable £. The diagonal entries 

(W N ) U , l<i<N, (1.3) 
are independent random variables (that are also independent from the off-diagonal 



entries), such that 



E{W N ) U = 0, c 3 := supE|(WV)«r < °°- (1-4) 

i,N 



In a similar fashion, in the Hermitian case, we assume that the off-diagonal 
entries 

^(WN^j^miW^ij, 1 < i < j < N, (1.5) 
are independent random variables such that 

2 

E(W N )ij - 0, Y[<nt(W N ) iS ] = V[3m(W N ) l3 ] = 2- m 5 := sup E|(Wjv)y| 5 < oo. 

(1.6) 

The diagonal entries 

(W N ) U , l<t<N, (1.7) 
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are independent centered random variables, independent from the off-diagonal en- 
tries, with uniformly bounded third moment of the absolute values. 

For a real symmetric (Hermitian) matrix M of order N, its empirical distribution 

of the eigenvalues is defined as /zm = 37 Y^iLi fan wnere A i > • • • > <Viv are the 
(ordered) eigenvalues of A/. Wigner semicircle law (see e.g. [7J, [I], [2]) states that 
almost surely the empirical distribution fix N of a random real symmetric (Hermit- 
ian) Wigner matrix Xjy converges weakly to the nonrandom limiting distribution 
fi sc - The limiting distribution /i sc is known as the semicircle distribution. It is 
absolutely continuous with respect to the Lebesgue measure and has the compact 
support [— 2<r, 2a]. The density of the Wigner semicircle distribution is given by 

^p(x) = ^V^-s'lt-W*)- ( L8 ) 

Wigner semicircle law can be reformulated as follows. For any bounded continuous 
test function <p : R — > R, the linear statistic 

1 - 1 

-^^(A 4 ) = -Tr^^)) =: tr N (^(l N )) 
i=l 

converges to J tp(x)dfj, sc (dx) almost surely; here and throughout the paper, we use 
the notation tr^ = -j^Tr to denote the normalized trace. 
The Stieltjes transform of the semicircle law is 



*,(*) := / = Z "f7^ , * € C\[-2aM. (1.9) 

7 z — x 2cH 

It is the solution to 

a 2 ^(z)-z 5CT (z) + l = (1.10) 

that decays to as \z\ — > 00. 

In this paper, we study the fluctuations of the outliers in the spectrum of finite- 
dimensional deformations of Wigner matrices. Starting with the pioneering work by 
Fiiredi and Komlos [20 , there have been several results on finite rank perturbations 
of matrices with i.i.d. entries, in particular [3D], [IS], [2S], [12], Q3], [H], [H], 0, 
[ID] . [35] . We also note several papers on the eigenvalues of sample covariancc 
matrices of spiked population models ([3j, [5], [6], [29]). 

This manuscript can be viewed as a companion paper to our recent works [31] 
and |28j on the non-Gaussian fluctuation of the matrix entries of regular functions 
of Wigner matrices. However, no knowledge of the machinery used in |31j and |28j 
is required, and the paper can be read independently from these papers. 

Let us consider a deformed Wigner matrix 

M N := -^=W N + A N = X N + A N . (1.11) 
V N 



Here Wn is a random real symmetric (Hermitian) Wigner matrix as defined in ([Lit - 
II. 4j) ( (|1.5lll.7l ). an d An is a deterministic Hermitian matrix of fixed finite rank r. 
We assume that the eigenvalues of An and their multiplicities are fixed. Let 

0i > . . . > e jo = > . . . > 9 j 

be the eigenvalues of A n each with fixed multiplicity kj. Clearly, the eigenvalue 
9j a = has multiplicity N — r and 2j=tj % = r - 
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The first theorem of this section, Theorem 11.11 concerns the convergence of the 

2 

extreme eigenvalues of the deformed random matrix. Let us denote po = 9 + ^- . We 
shall use the shorthand notation pj for pg j . Theorem 11.11 was originally proved by 
Capitaine, Donati-Martin, and Feral in |12j in the case when the common marginal 
distribution of the matrix entries is symmetric and satisfies a Poincare inequality. 

Theorem 1.1. LetX^s — ~^ n *Wn be a random real symmetric (Hermitian) Wigner 
matrix satisfying h Q]| l.J$ (respectively lll.5\\1.7\ l). Let J a + be the number ofj's 
such that 9j > a and J a - be the number of j 's such that 8j < —a. 

(a) For all j = 1, . . . , J a + and i = l,...,kj, Ajt 1+ ...+fc 3 -_ 1+ j p j: 

(b) Afc 1+ ... + fc^ ++ i -> 2a, 

( c ) ^k x +-+kj-j^_ -2a, 

(d) For all j = J — J a - + 1, . . . , J and i = l,...,kj, Xk 1 +...+k j - 1 +i pj. 
The convergence is in probability. 

In other words, the first k\ largest eigenvalues of Mn converge to pi, the next &2 
largest eigenvalues converge to p2 , • ■ • , the J a + th bunch of the largest eigenvalues 
converge to pj + , the next largest eigenvalue converges to 2a (since it corresponds 
to a nonnegative eigenvalue of An which is not bigger than a), etc. 

Remark 1.1. // random variables {Wn)^, 1 < i < j < N, satisfy a Poincare 
inequality U.12\) with constant fj,j,jv uniformly bounded from zero, Vi t j j$ > v > 0, 
the convergence holds with probability one. 

We recall that a probability measure P on K M satisfies a Poincare inequality 
with constant v > if, for all continuously differentiable functions / : R M — > C, 

V P (/) = E P (|/(a0 - E P (f(x))\ 2 ) < -E P [\Wf(x)\ 2 } (1.12) 

v 

Note that the Poincare inequality tensorizes and the probability measures satisfying 
the Poincare inequality have subexponential tails f |21j. pQ) . In particular, if the 
marginal distributions of the matrix entries of Wjv satisfy the Poincare inequality 
with constant v > 0, then the joint distribution of (Wu)ij, 1 < i < j < N, also 
satisfies the Poincare inequality with the same constant v. By a standard scaling 
argument, we note that if the marginal distributions of the matrix entries of Wn 
satisfy the Poincare inequality with v > then the marginal distributions of the 
matrix entries of Xn — TT^^V satisfy the Poincare inequality with constant Nv. 

Theorem 11.11 follow from Theorem 11.21 formulated below. Theorem 11.21 is con- 
cerned with the distribution of the outliers, i.e. the eigenvalues of Mm correspond- 
ing to 9j > a. Namely, we are interested in the fluctuation of the outliers around 
Pj, 1 < j ' < J a +- Let us consider a fixed eigenvalue 9j of An such that 6j > a. In 
general, if one does not assume some additional information about the structure of 
the eigenvectors of A jv corresponding to 6j , the sequence of random vectors 

(yN(\ kl+ ... +k] _ 1+l - P:j ), i = l,...,fcj) (1.13) 

does not converge in distribution as N — J- oo (see Theorems 11.31 and 11.51 below) . 
However, one can prove that the sequence (|1.13[) is bounded in probability (tight). 
We recall that a sequence {£at}at>i of R m -dimensional random vectors is bounded 
in probability if for any e > there exists L(e) that does not depend on N such 
that P(|&vi > L{e)) < e for all N > 1. 
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Theorem 1.2. LetXw = T^^Ov be a random real symmetric (Hermitian) Wigner 
matrix defined in hl.V\ 1.1$ (respectively \1.5\l.lty ). Let 1 < j < J CT +, so £/ie eigen- 
value Oj of An satisfies 9j > a. Then the sequence of random vectors 



+...+k^ 1+ t - pj), i = l,...,kj) (1.14) 



is bounded in probability. In addition, if the marginal distributions of the matrix 
entries of Wn satisfy the Poincare inequality il.lUft) with constant Wt,j',JV uniformly 
bounded from zero, the following holds with probability 1 

/logiV\ 

- Pj ; = O — -==- , i = 1, . . . , kj. (1.15) 



TV . 

Remark 1.2. Theorem MM clearly implies parts (a) and (d) of Theorem \1. 11 To 
see that parts (b ) and (c) of Theorem \1.1\ also follow, we note that for any fixed 
positive integer I > 1 the l-th largest eigenvalue of Xn converges in probability to 
2a. This is a simple consequence of the convergence of the largest eigenvalue of Xn 
to 2a and the semicircle law. Then the interlacing property and Theorem \1.S\ imvlu 
the desired result. 

Remark 1.3. The bound \1.15\) means that there exists a sufficiently large deter- 
ministic constant C = C(a,v,9i, . . . , 9 r ) > 0, such that with probability 1 

, ^ ClogiV . 
\Mi+...+k i -i+i - Pj\ < y= , i = l,...,%, 

for all but finitely many N. 

To study the fluctuations of the outliers in more detail, we consider two special 
cases following [j"3"] . 

Case A ( "The eigenvectors don't spread out" ) 

The orthonormal eigenvectors of Am corresponding to 9j depend on a finite num- 
ber Kj of canonical basis vectors of (without loss of generality we can assume 
those canonical vectors to be ex, ... , e^,), and their coordinates are independent of 
N. 

Case B ( "The eigenvectors are delocalized" ) 

The l°° norm of every orthonormal eigenvector of An corresponding to 9j goes 
to zero as N —> oo. 

Following |13) . we denote by k a + = ki + . . . + kj + the number of positive 
eigenvalues of A n bigger than a (counting with multiplicities) and by k > k a + the 
minimal number of canonical basis vectors ei, . . . , ejy of C N required to span all 
the eigenvectors corresponding to the eigenvalues 9i,. .. ,9j + . 

We also denote 

% 02^2" (1-16) 

The next theorem is a consequence of Proposition 11.11 below and Theorems 1.1 and 
1.5 in [31] , We use a standard notation (3 = 1 in the real symmetric case and /3 = 2 
in the Hermitian case. 

Theorem 1.3. LetX^ = "^y^v be a random real symmetric (Hermitian) Wigner 
matrix defined in d 1.1\\ l.J$ (respectively \1.5\1.7^ ) such that the off -diagonal entries 
{Ww)ij> 1 < i < j < N, are i.i.d. real (complex) random variables with probability 
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distribution fi and the diagonal entries (Wn)u, I < i < N, are i.i.d. random 
variables with probability distribution In Case A, the kj -dimensional vector 



c 



v / iV(Afc 1+ ... +fcj _ 1+1 - pj), i = l,...,h 



converges in distribution to the distribution of the ordered eigenvalues of the kj x kj 
random matrix Vj defined as 

Vj-UJiWj+H^Uj, (1.17) 

where 

(i) Wj is a Wigner random matrix of size Kj with the same marginal distribution 
of the matrix entries as Wn , 

(ii) Hj is a real symmetric (Hermitian) Gaussian matrix of size Kj, independent 
of Wj , with centered independent entries H st , 1 < s < t < Kj, (*Ke H st , 3m H st , 1 < 
s < t < Kj, H pp , 1 < p < Kj in the Hermitian case) with the variance of the entries 
given by 



•'' =^-|gp^ ; s = X,...,K 3 , (1.18) 



E(iJ^) = ^2 97 1 < s < t < Kj, in the real symmetric case, (1-19) 

4 

E((fReiJ s4 ) 2 ) = E((3mH st ) 2 ) = 1 < s < t < K d , in the Hermitian cast 

' j _ G i 

(1.20) 

where 

K4 (/i) := / Vfd^x) - (4 - [3){ [ \x\ 2 dpi{x)) 2 , (1.21) 



is the fourth cumulant of fi, and 

(Hi) Uj is a Kj x kj such that the (Kj- dimensional) columns of Uj are written 
from the first Kj coordinates of the orthonormal eigenvectors corresponding to 6j . 

In [13] . Theorem 11.31 was proved for symmetric marginal distribution satisfying 
the Poincare inequality (|1.12p under an additional technical assumption that k — 
o(^/N), where k is defined in the paragraph above (|1.16|) . 

Using Theorems 4.1 and 4.2 from [28j, one can extend the results of Theorem 
11.31 to the case when the entries of Wn are not identically distributed provided the 
distribution of the entries (Wn)u, 1 < i,l < Kj does not depend on N. 

Theorem 1.4. LetXj^ = -j=Wn be a random real symmetric (Hermitian) Wigner 
matrix defined in 11. l^l.JJ) ( respectively i U.5l|i.7| ) ) such that the distribution of the 
entries (Wn)u, 1 < i,l < Kj does not depend on N. Let us assume that the limits 

m 4 (i):= lim 1 £ EK^)^ 4 (1.22) 

exist for 1 < i < Kj . 

Then in case A, the results of Theorem \1.3\ hold with Kn(fi) in (1.18\) replaced by 

Ki(s) :=m 4 (s) - (A-f3)a 2 , s = l,...,Kj. (1.23) 

The next theorem deals with the Case B. 
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Theorem 1.5. LetXw = ^T^Wn be a random real symmetric (Hermitian) Wigner 
matrix defined in fHH| l-4\ l (respectively U.5\\1.7\ l) such that the off-diagonal entries 
(WN)ij, 1 < i < j < N, are i.i.d. random variables with probability distribution 
H and the diagonal entries {Wn)h, 1 < i < N, are i.i.d. random variables with 
probability distribution fix. In Case B, the kj -dimensional vector 

(ce J VN(X kl +...+k^ 1 + l - Pj), i = l,...,kjj 
converges in distribution to the distribution of the (ordered) eigenvalues of a kj x kj 

8 2 <r 2 

GOE (GUE) matrix with the variance of the matrix entries given by 6 ^_ a i provided 
k = o(VN). 

Remark 1.4. We recall that k has been defined above as the minimal number of 
canonical basis vectors e\, . . . r/v required to span the eigenvectors corresponding to 
the eigenvalues 9±, . . .6j + . 

Thcorcm ll.5l is an immediate extension of the result of Capitaine, Donati-Martin, 
and Feral from |13j to our setting since their arguments apply essentially unchanged 
as soon as Theorem 1 1.1 1 is established. 

It should be noted that Benaych-Georges, Guionnet, and Maida consider in [5] 
perturbations of a random Wigner matrix by a finite rank random matrix with 
eigenvectors that arc cither independent copies of a random vector v with i.i.d. 
centered components satisfying the log-Sobolev inequality or are obtained by Gram- 
Schmidt orthonormalization of such independent copies. The distribution of the 
outliers is given in Proposition 5.3. of [5]. Let us denote the distribution of the 
first component of v by v. If the fourth cumulant n^(y) of v vanishes, the limiting 
distribution of the outliers is similar to the result of Theorem 11.51 and given by 
the distribution of the ordered eigenvalues of a GOE (GUE) matrix. If the fourth 
cumulant docs not vanish, one has to add a diagonal matrix with i.i.d. real Gaussian 
entries to a GOE (GUE) matrix. 

One of the most important results of j9], |10j concerns the distribution of the 
"sticking" eigenvalues (i.e. the eigenvalues that correspond to \9j\ < a). In The- 
orem 5.3 of [5], Benaych-Georges, Guionnet, and Maida prove that their limiting 
distribution is given by the Tracy- Widom law. 

Let us briefly describe a key ingredient of the proofs of Theorems 11.2111.41 We 
use the notation 

R N (z) := (zI N - Xn)' 1 , zeC\ [-2ct, 2a], (1.24) 

for the resolvent of Xn- Clearly, Rn(z) is well defined for z G C \ E. Since the 
spectral norm of Xn converges to 2a in probability (see e.g. [4], and Proposition 
2.1 in [28]), Rn(x) is well defined for a fixed x € R\ [— 2a, 2a] with probability 
going to one. Since our results will deal with the limiting distribution of random 
variables y/~N ((u, Rn(x)v) — g a (x)(u, v)) in the limit N — > oo, this should not lead 
to ambiguity. 

Let us consider a fixed eigenvalue 9j of An such that 6j > a and denote by 
, . . . , the orthonormal eigenvectors of An that correspond to the eigenvalue 
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9j . Denote by S]^ the kj x kj matrix with the entries 

3« :=VN((v«\R N (j> j )v®)-g a (p j )8 a )=VN^ , 

3 (1.25) 

2 

where we recall that pj = 9j + . The following proposition plays an important 
part in our proofs. 

Proposition 1.1. Let yi > . . . > yu, be the ordered eigenvalues of the matrix H^. 

ViV(A fcl+ ... +fcj _ 1+l -p,) + ^- 2 /. i ^0, i = l,...,^-, (1.26) 

9 a KPj I 

in probability. 

Remark 1.5. A simple computation gives 

-_L-=0f -a 2 . (1.27) 

9a [Pj) 

It should be mentioned that the key part of the proof of Proposition 11.11 is a 
lemma from [9, which is stated as Lemma B~2"l in Section^ Proposition ll.ll indicates 
that the question of the limiting distribution of the outliers of the spectrum of the 
deformed Wigner matrix Mpf can be reduced to the question about the limiting 
distribution of the entries of (|1.25[) . 

Let us denote by (u, v) — UiVi the standard Euclidean scalar product in C N . 
The next theorem deals with the values of the sesquilinear form (u^ N \ f (X n)v^ n ^} 
where / is a sufficiently nice test function on K and u^ N \ 6 C N are nonrandom 
unit vectors in C*, i.e. 



N 

\u\\ 2 = (u,u) = ^2 H 2 - 

1 



Without additional assumptions on vS N > and v^ N \ the sequence 



does not necessarily converge in distribution. However, one can show that it is 
tight. 

We say that a function / : / C R — > R belongs to C"(7) if / and its first n 
derivatives are continuous on /. Define 

dx l 



l/llc»(i) == max I-7-7OOI, xe I, 0<l <n 



We use the notation C™(R) for the space of n times continuously diffcrcntiable 
functions on R with compact support. Define 

/>oo 

||/||„,i := max( / \d k f / dx k (x)\dx , < k < n), (1.28) 



|/||„.i. + :=i»ax( / (\x\ + l)\il(x)\dx, 0<i<n). ( 1 .29 ) 
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We recall that a function / : R — > R is called Lipschitz continuous on an interval 
/ C R if there exists a constant C such that 

\f(x)-f(y)\<C\x-y\, for all x, ye I. (1.30) 

We define 

\f\c,R = sup ; _ , (1.31) 

x^y \x y\ 

and 

ifi „, m l/fa)-/(y)l n o 9 n 

x^ty, x,y£[-2a-8,2a+8] F — 2/| 

Theorem 1.6. LetXjq = be a random real symmetric (Hermitian) Wigner 

matrix defined in kl.l§1.4\ ) (respectively il.5\\1.7\ )). Then the following statements 
hold: 

(i) If f is a C 5 (R) function such that \\f\\ 5< i is finite, and u^, e 

C , iV > 1, are two nonrandom sequences of unit vectors (in standard Euclidean 
norm), then there exists a constant C'onst(a 2 ,7715,03) such that 

v((uW,f(X N )vW)) <ConstMhl. 



In particular, the sequence VN {{u {N \ f{X N )v {N) ) - E(u {N \ f(X N )v^)) is bounded 
in probability. 

(ii) If f £ C^(R), with supp(f) C [— L,+L], where L is some positive number 
then there exists a constant Const(L, a 2 , 777,5, C3) such that 

\E(u( N \f(X N )vW) - (^l^)) f(x)dn sc {dx)\ (1.34) 

J -2a 

< Const(L,<r 2 ,m 5 ,c 3 )\\f\\ C 8 {[ _ L +L]) -^=. 

V TV 

//, ?77 addition, f e C 9 (R) and ||/||g,i.+ is finite, then 



\E(u( N \f(X N )vW)-(uW,vW) [ f(x)d» sc (dx)\ < Const(a 2 ,m 5 ,c 3 )\\f\\ g . 1 . + ^=, 

J -2a \N 

(1.35) 

where Const(a 2 ,7775,03) depends on a 2 ,7775, and c 3 . 

(Hi) If the marginal distributions of the entries of Wn satisfy the Poincare in- 
equality hi. 12) with a uniform constant v > 0, and f is a lipschitz continuous 
function on [—2a — 8,2a + 5] that satisfies a subexponential growth condition 

\f{x)\ < aexp(b\x\) forallxeR, (1.36) 

for some positive constants a and b, then 

p(\(u^ N \f(X N )v^)-E(u^ N \f(X N )v^)\>t) (1.37) 

- 2Kexp ("^t£) +(2i " +0(1))exp {-^r s ) ' 

where \f\c.s is defined in 11.32) , 

A' = -^2 ,; log(l-2- 1 4- 4 ), (1.38) 

i>0 
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and v is the constant in the Poincare inequality \1.12\i . 

(iv) If the marginal distributions of the entries of Wn satisfy the Poincare in- 
equality il.lty) with a uniform constant v > 0, and f is a Lipschitz continuous 
function on R, then 

P f(X N )vW) - E(u^ N \f(X N )v^)\ > t) (1.39) 

/ VvNt \ 

where \f\c,R is defined in \1.31) ). 

(v) If the marginal distributions of the entries of Wn satisfy the Poincare in- 
equality H1.1S\) with a uniform constant v > 0, f £ C 8 (R), and f satisfies the 
subexponential growth condition H1.36\) , then 

E(uWj(X N )vW) = (u( N \ V W) J° f(x)dfi sc (dx) + o(-^Jj . (1.40) 

We finish this section by formulating our last theorem, Theorem 11.71 which 
allows us to extend Theorem 11.31 (see Remark 15.11 in Section [5]) . Assume that that 
the off-diagonal entries (WN)ij, 1 < i < j < N, are i.i.d. random variables with 
probability distribution fi and the diagonal entries (Wn)u, 1 < i < N, are i.i.d. 
random variables with probability distribution 

Let us consider u^ N \ £ C N that are independent of N for all N > No, in a 
sense that only a fixed finite number of the coordinates of u^ N \ are non-zero 

and the coordinates do not change with N for TV > Nq. In this case, we can write 
= u, «W = v, with the understanding that as the dimension N grows, one 
just adds more zero coordinates to u and v. As an immediate consequence of the 
results of Theorem 1.1 (real symmetric case) and Theorem 1.5 (Hermitian case) in 
[31j . the random sequence 



N((u,R N {z)v)- gir (z){u,v)) (1.41) 

converges in distribution as N — > oo. Without loss of generality, we will consider 
the real symmetric case; the Hermitian case is essentially identical. Let m be an 
arbitrary fixed positive integer. Denote by PS m >{z) the m X m upper- left corner of 
the matrix Rn(z). Theorem 1.1 in |31) states that a matrix- valued random field 

T N (z) = (rt m \z) - g a (z)I m ^J , z €C\ [-2a,2<7], (1.42) 

with values in the space of complex symmetric m x m matrices, converges in finite- 
dimensional distributions to a random field 

T(z) = gl(z)(W^+Y(z)), (1.43) 

where W^ m ^ is the mxm upper-left corner submatrix of a Wigner matrix Wn, ga(z) 
is the Stieltjes transform (|1.9[) of the Wigner semicircle law, and 

Y(z) = (Y ZJ (z)),Y t3 (z) = Y j{ (z), 1 < i,j < m, 

is a Gaussian random field with the covariance matrix given by the formulas (1.18)- 
(1.23) in the real-symmetric case and (1.50)-(1.55) in the Hermitian case in [3T]. It 
is important to note that Yij(z), 1 < i < j < to, are independent random processes 
for different indices (ij). 
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Let us extend the definition of T(z) to that of an infinite-dimensional matrix 
T(z) pq , 1 < p, q < oo, using the formulas (1.18)-(1.23) (respectively (1.50)-(1.55)) 
from [31] . Thus, the r.h.s. in (jl.43l) defines now the m x m upper- left corner of the 
infinite matrix T(z). Then Theorem 1.1 of |31j implies that 

y/N((u,R N (z)v) - g a {z)(u,v)) -> (ii,T(z)ti), (1.44) 

in distribution. 

Let u,v £ l 2 (N). It follows from the Kolmogorov three-series theorem (see e.g. 
[17] ) that (u, T(9j)v) is well defined as an infinite sum of centered random variables 
with summable variances. For our analysis of the outliers in the spectrum of finite- 
rank deformations of Wigner matrices, it will be useful to have the following result. 

Theorem 1.7. LetXw — ZJ^^N be a random real symmetric (Hermitian) Wigner 
matrix defined in d l.J$ (respectively hl.&\1.7(j ) such that that the off-diagonal 
entries (Wjv)ij) 1 < i < j < N, are i.i.d. random variables with probability distri- 
bution pL and the diagonal entries (Wn)u, 1 < i < N, are i.i.d. random variables 
with probability distribution fi\. 

Let I be a fixed positive integer, iti, . . . ,ui, be a collection of non-random vectors 
in l 2 (N), and let Up , 1 < p < I, denote the projection of u p to the subspace 
spanned by the first N standard basis vectors ei, . . . , ejv- Then the joint distribution 
of 

Vn((u< p n \R n (z)uW) -g a (z) , l<p,q<l, 

converges weakly to the joint distribution of (u p , T(z)u q ), 1 < p, q < /. 

The rest of the paper is organized as follows. Section[2]is devoted to the estimates 
on the mathematical expectation and the variance of the values of the resolvent 
sesquilinear form (u^ N \ R^(z)v^ N ' > ) , where u^ N \ v^ N ' are arbitrary non-random 
unit vectors in C . Using the estimates obtained in Section [21 we prove Theorems 
11.61 in Section [31 Theorem 11.21 is proved in Section 2) Finally, Theorems 11.31 11.41 
and 1 1.71 are proved in Section [5] In the Appendix, we discuss tools used throughout 
the paper. 

We would like to thank A. Guionnet for bringing our attention to the preprints 
[9] and [ID]. 

2. Mathematical Expectation and Variance of Resolvent Sesquilinear 

Form 

This section is devoted to the proof of the main building block Theorem 11.61 
namely Proposition ^. II 

Without loss of generality, we can restrict our attention to the real symmetric 
case. Let = (ui, . . . , ujv), = (vi, . . . , vn) be nonrandom unit vectors in 

C N . When it does not lead to ambiguity, we will omit the superscript in and 
Define 

m := ( U ( N \R N (z)vW) = J^HiRijVj. (2.1) 

ij 

When it does not lead to ambiguity we will use the shorthand notation, Rij, for 
the ij-th entry (Rpj{z))ij, of the resolvent matrix Rj^{z). 

Proposition 2.1. Let Xn = -^=Wn be a random real symmetric (Hermitian) 
Wigner matrix defined in U.lVjl.4\ ) ( ^1.5\1.7\j ), R N {z) = (zLn-Xn)' 1 , z £ C\M, 
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and w- N 1 — (ui, . . . , mat), — (vi, . . . , vn) be nonrandom unit vectors in <C N . 

Then 

E m = E( U W R N (z)v^) = g a (z)( U W,vW) + O ( ] ) , (2.2) 

\| 3mz\WN / 

uniformly on bounded subsets of C \ R, 

E(uW,R N (z)vW) = 3 ,(.)(^),«W) +0 ((\z\ + M) Ps(l ^y l ' 1) ) , (2.3) 

Y m =Y(u^,R N (z)v (N) ) = O ( Ps(l y l) ) , (2.4) 

uniformly on C \ R, where Pi(x), I > 1, denotes a polynomial of degree I with fixed 
positive coefficients, and M is some constant. 

Remark 2.1. In i/ie case when u^ N ' and are standard basis vectors, u = 

e^, t> = £j, the mathematical expectation and the variance of (u^ N \ Rn(z)v^ n ^) 
have been studied in [31j . In particular, it has been shown there in Proposition 2. 1 
and (3.27) that 

ER v =g tf (z) + 0( J5 ^ N ) t (2.5) 
uniformly on bounded subsets of C \ R, and 

ER H =g a {z) + oUz\+M) P7{rJ ™ A 1] \ , (2.6) 



ERij = o(^^),^ = o(M^l ); = , ,,,, 



^ = 0[ P6irJ Z Zn '), l<iJ<N, (2.8) 



N 



uniformly on C \ . 



Remark 2.2. In [15] , Erdds, Yau, and Yin studied generalized Wigner matrices 
(defined at the beginning of Section 2 of [18j ). and obtained the following estimates 
provided the marginal distributions have subexponential tails 

¥ S^max^z) - 9r7 (z)\ > - J^L_| < Cexp [-c(logiV)^] , (2.9) 

F^max^lRijiz)] > j^'j < Ccxp [-c(logiV)^] , (2.10) 

where < <f> <1, C > 1, c > are some constants, 4/4> < I < C log AT/ log log AT, 
Af-^log A0 loi <3mz< 10, |£Hez| < 5a, and N is sufficiently large. 



Remark 2.3. R follows from our proofs that the error term on the r.h.s. of i2. 2\) 

can be replaced by O ( m "p l 1 ^i 1 |7^ lll) ) , where \\u\\i = J2iLi \ u i\- 

The rest of the section is devoted to the proof of Proposition 12.11 

Proof. Without loss of generality, we can restrict our attention to the real symmetric 
case. The proof in the Hcrmitian case is very similar. We start by proving 
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Using (zIn — Xn)Rn{z) — In, we write 
zEy^^UjRjjVj = EyVj (Sij + X ik R kj )vj = (u, v ) + ^ UiVjE(X ik R k] ). (2.11) 

i j i j ij k 

Applying the decoupling formula (|6.1[) and (16. 4116. 5p to the term E{X ik R k j) in 
(|2.1ip . we obtain 

2 

zE77jv = (u, v) + a 2 E (r/Artrwi?) + ^E ((tt, (i?^)) 2 ")) (2.12) 
, ^ V[(Wv)«]-2<7 2 _ 

+ > t jz UiVjE(RiiRij) +rjv, (2-13) 

where 77 at is defined in (|2.ip . and r^r contains the third and the fourth cumulant 
terms corresponding to p — 2 and p = 3 in the decoupling formula (|6.1I) for i 7^ fc, 
and the error terms due to the truncation of the decoupling formula (|6.ip for i =/= k 
at p = 3 and for i = k at p = 1 . 
It follows from 

|V[(WV)ii] -2fT 2 | <consi(a 2 ,C3), 

that the first term in (|2.13p can be written as the mathematical expectation of i x 
(a, Rn(z)v), where the vector a has coordinates (V[(Wjv)ii] — 2<r 2 )RaUi, 1 < i < N. 
Using (|6.7p . one obtains by estimating ||a|| from above that 

i,j 

The third cumulant terms {p = 2) give 

2!_/v3/ 2 ^ ^ K 3 (i, k)uiRij RikRkkVj) + 2E( ^ K 3 (i,k)uiRuR kk R k jVj) 

i,j,k:i^k i,j,k:ij^k 

+ 2E( ^ Kaii^^iRkiRkiRkjVj)}, (2.15) 

where by ^3(1, fc) we denote the third cumulant of (Wjv)jfc. We note that 

|/C3(z, fe)| < Const{m^), 

uniformly in i 7^ k and N. To estimate the absolute value of the first term in (|2.15l) , 
we first sum with respect to j and then use the Cauchy-Schwarz inequality and 
(|577|) to obtain 

i 7 j,k:i^k i^k 



<Const(m 5 )E ( |i4fc| 2 ^ Nll^,;|| |(i?^l ) < ^T^^ ' ( 2 - 16 ) 
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To estimate the absolute value of the second term in (|2.15|) , we write 
J~] K 3 (i,k)uiRuRkkRkjVj\ = K 3 (i,k)uiRiiRkk(Rv)k\ 



\E 

i,j,k:i^£k i^k 



< 



Const(m 5 )E ^ |^|||i?|| 2 |(i? V ) fc |j < Const(m 5 )VN 1 1 1 v 1 1 E| | i? w (^) 1 1 3 < TV^f^l. 

(2.17) 

Finally, we bound the last of the third cumulant terms in (|2.15[) as 

|E K^i^fyujRkiRkjRkjVjl = |Ey^ n 3 (i, k)ui(R ki ) 2 {Rv) k \ 

i : j,k:i^k i^k 

< Const(m 5 )Ej2 \u t \\R hl \ 2 \\R N (z)\\ \\v\\ < y/N C ^™*\ (2.18) 

ik \ mz \ 

where we again used (|6.7|) and 

\Rk>\ 2 = \\R N (z)e t \\ 2 < \\Rn(z)\\ 2 < 

Combining the bounds (j2.16H2.18")) . we see that the contribution of the third cu- 
mulant terms to r m in (12. 12112.151) is bounded from above by O I -, \, 7= ) . The 

11 " " J \\3mz\ 3 VN J 

fourth cumulant terms give 

^j^2[18E( ^2 K4,(i,k)uiRnRikRkkRkjVj) + 6E( ^ k)uiRii(R k k) 2 Rij v ) 

(2.19) 

+ 18E( Ki{i,k)ui(R ki ) 2 RkkRijVj) + 6E{ ^ k)ui(Rki) s R kj Vj)]. 

i,j,k:i^£k i,j,k:i^k 

To estimate the absolute value of the first term in (12 . 19[) . we note that 

|E K i (i 1 k)u i R ii R ik R kk R ko v \ = \EY^K i (i 1 k)u l Ri i R lk R kk (Rv) k \ (2.20) 

i,j,k:i^k i^k 

-Const(m^) 



3m z\ 



(z)\\ 2 \\v\ 



< Const(m 5 )E ^ \u t \\\R\\ 2 \R lk \\(Rv) k \J < y/N- 

where we used the bound 

^2\Rik\\(Rv)k\ < \\RN(z)e l \\\\R N (z)v\\ < \\R N (z 

k 

(|6.7|) . and the fact that the fourth cumulants of (Wjv)ifc are uniformly bounded in 
absolute value by some constant Const(m^). 

To estimate the second term in (|2.19[) . we write 

|E ^{hk)u l R H {R kk ) 2 R l3 v 3 \^\&Y J ^{i,k)u l R ll {R kk ) 2 {Rv) l \< (2.21) 

i,j,k:i^£k i^k 

Const (ms) 



Con S t(m 5 )E(J2\Rkk\ 2 \\RN(z)\\J2\^\\( Rv W < V " 

k 



\3mz\ 4 
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The other two terms in (|2.19j) are estimated in a similar fashion. Each of them is 
O ( ^jmlp^ ) ■ Therefore, the fourth cumulant terms give the contribution O ( N \ y mz \ i 
to tn in (|2. 12112.131) . 

Finally, we estimate the error terms due to the truncation of the decoupling 
formula at p = 3 for i ^ k and at p = 1 for i = k. Here, we treat the error term 
due to the truncation of the decoupling formula at p — 3 for i =/= k. The second 
error term can be treated in a similar way. To estimate the error term, we have to 
consider expressions of the following form 

n- 5 / 2 e i«bc*. A?) sixp ]^ii^£Vm^^m-r£5']|-«^ 5 ik-r <b) «)-m , (2-22) 

where a,b,c,d,e, f,p,q,s € the supremum in f|2 . 22[) is considered over the 

resolvents = (z — X^) -1 , I = 1, . . .5 of rank two perturbations X$ = Xpj + 
xE ik of X N with {Eik)jh = Sijdkh + 5 ih 8kj- Estimating each entry of i? w by rg^r , 
taking into account that 

N 

^\ui\<VN\\v\\=VN, 

i=l 

and using the fact that the fifth cumulants of the off-diagonal entries of Wn are 
uniformly bounded, we bound (|2.22[) from above by O (ivpmTp) ■ 

Combining the estimates of the third and the fourth cumulant terms and the 
truncation error term, we can rewrite the Master equation (|2. 12|) as 

zE m = (u, v) + a 2 E ( m tr N R) + ^E ({u, (R N (z)) 2 v)) + O ( L 

(2.23) 

where we recall that by Pi we denote a polynomial of degree I with positive coeffi- 
cients that do not depend on N. 
Since 

\{u,(R N (z)fv)\<\\u\\\\v\\—^, 

we obtain 

zE m = (u, v) + a 2 E ( m tr N R) + O Wl ) \ (2 24) 



<N 

Finally, we have to estimate the term E {rj^tx^R) in the Master equation. We write 
|E (tr N R(u, Rn(z)v)) - ga(z)E(u, R N (z)v) \ < (V ({u, R N (z)v))) 1/2 {Y(ti N R)) 1/2 

(2.25) 

+ \9N(z)-g*(z)\\\u\\\\v\\—^—, (2.26) 
where we use the notation 

g N (z) :=Etr N R N (z). (2.27) 
The variance V(trjv-R/v(z)) has been estimated in Proposition 2 of [33] as 

V(tr N R N (z)) = O ( I ) , (2.28) 
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uniformly on C \ R. It follows from the proof of (|2.28j) that the bound is valid 
provided the fourth moments of the off-diagonal entries are uniformly bounded and 
the second moments of the diagonal entries are uniformly bounded ( [34] ) . Applying 
the bound \(u,R N {z)v}\ < Jgf^L = and ([23]), we obtain 

'PrdJmzl- 1 ) 



E (trjv^u, Rn{z)v}) = g a (z)E{u, R N (z)v) + O 



N 



uniformly on bounded subsets of C\R. This allows us to write the Master Equation 
for T] N = (u( N \R N (z)vW) as 

zEn N = (u, v) + a 2 g a (z)Er lN + O f ^\ 3v ^\ l \ _ ( 2 .29) 



N 

uniformly on bounded subsets of C \ M. Since z — a 2 g a [z) = l/g a (z) and g a (z) is 
bounded, we arrive at 

E{u, Rn(z)v) = g a (z){u, v) + ( Pj ^ J ^ I ) . ( 2 .30) 



<N 

which is exactly the estimate (I2.2[) of Proposition 12.11 

To prove ([231), we note that (12. 25112. 261) . (j2T28|) and ([2JJ) imply 

P 8 {\3mz^) 
N 

uniformly on C \ R. Therefore, one can rewrite (|2.24j) as 

zE m = (u, v) + a 2 g a (z)E m + O f(\z\ + M) Pg(l ^ ^\ , (2.31) 



E (tr N R{u,R N {z)v)) = g a (z)E(u,R N (z)v) + [ (\z\+M) 



uniformly on C \ 1, which implies (|2.3p . 

Now, we turn our attention to the proof of (|2.4p . The key part of the proof is 
the following lemma. 

Lemma 2.1. Let Xjy = -j=Wn be a random real symmetric (Hermitian) Wigner 
matrix defined in U.l\l.$ ( U.3\l.l\j ), R N {z) = (zI N - X N )-\ z £ C \ K, and 
U ( N ) — , . , ,Mjy), v^ N ) = (vi, . . . ,vn) be nonrandom unit vectors in <C N . Then 

(z-v 2 g N (z))V((u( N \R N (z)vW)) = yJ(Y((uW,R N (z)vW))0 (M^LH 



O 



N 

(2.32) 

PedJm^- 1 ) 
N 

uniformly in z £ C \ M, where <?jv(-z) is defined in 

Proof. As always, we will suppress the dependence on TV in u = vS N ' and v = v^ N \ 
and use the notation -q N = (u,Rjy(z)v). Clearly, Y(r]jy) = E\?]n\ 2 — |E?7jv| 2 , and 
{u, Rn(z)v) = (v,Rn(z)u). We start with the following form of the Master equation 
for r/ N , 

zEi]n — (u, v) + (7 2 gM{z)EriM (2.33) 
+ ^E( J2 ^(hk)u i R u (z)R kk (z)R kj (z)v j )+o( J 5 ^ 3 ^ } ) , (2.34) 

i,j,k:i^k 



16 A. PIZZO, D. RENFREW, AND A. SOSHNIKOV 

uniformly on C\R. We singled out in (|2.34[) the only term in which is 0(N^ 1 / 2 ), 
namely (|2.17|) . As we have shown above, all other terms in tm can be estimated as 
O ^ P5 (I J ^I H j , Multiplying both sides of the equation by W)n, we obtain 

zlE^I 2 = (u,v)W^ + a 2 \E m \ 2 g N (z) (2.35) 
+ N^/2^( X! K 3 (i,k)uiRii(z)Rkk(z)Rkj(z)vj)TS.r] N + O 1 " { 



i,j.k:i^k 



N 



Our next goal is to obtain the Master equation for zE(\r]N\ 2 )- As before, we use the 
resolvent identity (|6.3[) to write 

zE(\r] N \ 2 ) = zE[^2'iIiR ij (z)v : jr] N '} = E[^Uj (<% + X ik Rkj(z)) UjJjjv] (2.36) 

ij ijk 

= (u, v)ErjN + / ] UjVjE(XikRkj (-g)W)- (2.37) 

ij'fc 

Applying the decoupling formula (|6.1|) and (|6.4H6.5[) to the term E(XikRkj{z)rjN~) 
in (|2.36M2.3"7| . we obtain 

2 

zE(M 2 ) = (u,v)WjN~ + a 2 E (\ m \ 2 tr N R N (z)) + ^E ((u, (R N (z)) 2 v)rJw) (2-38) 
\- V[(Wv)«] ~ 2a 2 _ a 2 / 

2_, UiVjElRuRijTiN} + —E \ 2_ UiRkjizjvj 

(2.39) 

+ 2_, — ]v — p^W^ — — J + rw ' t 2 - 40 ) 

where r^r contains the third and the fourth cumulant terms corresponding to p = 2 
and p = 3 in (|6.ip for k = i, and the error due to the truncation of the decoupling 
formula (|6.1[) at p = 3 for k ^ i and at p = 1 for k = i. Clearly, 

««, (iMz))Mw) - (pd^) • ( 2 - 41 ) 



For fc 7^ z, we have 
Rn{z)u) 



dX ik 
d 2 (v,R N (z)u) 
9Xf k 



= (R N (z)v)i(R N (z)u)k + (Jfo(z)»)k(iM*)«)i> ( 2 - 42 ) 



2J2«(2)(ito(z)v)*(.Rjv(2)tt)k (2.43) 



+ 2i? jfc (z)(i? 7 v(z)w) l (i? 7V (z)u) fe + 2R lk {z){R N (z)v) k {R N {z)u) % (2.44) 

+ 2R kk (z)(R N (z)v)i(R N (z)u)i, (2.45) 

d(RN(z)w)i = Ru{E){RNiE)w)k + Rik ( 2 )(R N (z) w ) t . (2.46) 
O^ik 
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For k = i, we have 

d(v, Rn(z)u 



d 2 (v, Rn{z)u) 
d(R N (z)w) 



{R N {z)v)i(R N (z)u)i, (2.47) 
2 -2R H (z)(R N (z)v) l (R N {z)u) l , (2.48) 



= R li {z){R N (z)w) l . (2.49) 



Using (|2.42|) and (|2.47|) . one can write the last term in (|2.39j) as 

- » /v> d(v,R N (z)u) \ 
N~ \ 2. dXik (2-50) 

^i,j,k:i^k J 

T7 E \ ^ "i-Rfcj(^)«i[(-Riv(^)u)i(Jljv(^)w)fc + (RN(z)v) k (R N (z)u)i] ) = O i — — 

N J Vl*H 3 ^ 

(2.51) 

The third cumulant terms in in (|2.40p can be written as 



^ £ „ 3 M)^E ( d2 ^ 2 {z) (v,RN{z)u)) (2.53) 

i,j,k:i=jtk \ ik / 



A/-3/2 

+ ^y- 2 E «3(i,wL( / ( y> ). (2.55) 

We are going to estimate the terms (|2 .53112 .55")) separately. We start with the 
last two. We claim that both (|2.54[) and (|2.55[) are O Ugmz^jv )- Indeed, consider 
first (|2~54f . It follows from (|6.4«6.5j) . ([2^g2]) . and ([2^47]) . that it is equal to 

2N 3/2 E K 3(«,fc)"^j]E([i? fcfc (z)i? lJ (z) + i? jfe (z)i? fc:) (z)][(i?Ar(z)'y)i(i? A r(z)u) jt + (R N (z)v) k (R N (z)u) 

i,j.k:ij^k 

(2.56) 

Let us estimate the term 

2^3/2 E ^3{i,k)u i v ] ¥ J (R kk (z)R i] (z)(R N (z)v)i(R N (z)u) k ) (2.57) 

in (|236| . 

We note that the Euclidean norm of the vector in with the coordinates 
K3(i,k)ui(RN(z)v)i, 1 < i < N, i ^ k, and for i = k is bounded from above by 
C °jm ( r| 5) - Thus ' it; follows from (ETTJ) and ||u|| = 1 that 

| J2 KtihQuiiRNmwRtiWl < C '^£^ ) - (2-58) 
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In addition, 

N i N ( PR \ 

Y,\Rk k (z)\\(R N (z)u) k \ < ^L-^\(R N (z)u) k \ =° ( j^p J ■ ( 2 - 59 ) 

Combining (^3^ and ((535)1 . we estimate (g37j) as O ( jj^rjv j • The other 
terms in (12.56)) can be estimated in a similar way, which implies that (12.541) is 

° (|3m^iv)- 

Now, we turn our attention to (|2.55p . Using (j2.43H2.43)) and (|2.48p . one can 
rewrite (|2.55p as 



N 3 / 



1/2 X! K 3(«,fc)'«iW : ,-E[i? fei (z)i?i i (z)(i?Ar(z)w) fc (i?jv(2)M)fe] (2.60) 



i,j 7 k:i^k 



TV 3 / 



X! K 3(«, fc)ui«j]E[ J R fe:) (z)i?. ifc (z)(i? A r(z)w) i (i? 7 v(2 : )w)fe] (2.61) 



i,j,k:i^k 



N 3 / 



3/2 X! K 3(«, fc)UiWjE[i? fc:) (z)i?i fc (z)(i?Ar(2:)w) fc (i?Ar(z)M) i ] (2.62) 



i,j,k:i^k 



+ X! K 3(«, fc)"i«jE[-Rfcj(2)-Rfcfc(^)(i?Af(z)w)i(i?A'(2)w) 4 ]- (2.63) 

i,j,kxi^=k 

We estimate (|2.60p . The subsums (|2.6HI2.6!5)) can be estimated in a similar way. 
The summation with respect to j in (12.60)) gives 

— ^ y^E[/t 3 (i, k)ui(R N (z)v) k Rii(z)(RN(z)v)k(RN(z)u)k\. 

Now, we estimate 

2l*3(i,fc)(^(*)«)fc||(ieiv(«)i;)fc||(i2iv(2)u) fc | < C ™<™*\ an d 



,3mz|' 

iV 



Combining the last two bounds, we obtain that (|2~60| is O (jj^r^ 
Finally, let us estimate (|2.53p . It can be written as 

i-^4E I K3(i,k)u i R ij (z)Rik(z)Rkk{z)v j (v,R N (z)u)\ (2.64) 



2!7V 3 / 
1 



2!iV 3 / 2 ' 
1 

2\N 3 / 2 ' 



,i,j,k:i^k 

^ K 3 (i,k)u i R il (z)R kk (z)R kj (z)v : j(v,R N (z)u) J (2.65) 

i i,j,k:i^k 

^ «3(i, k)uiRki(z)Rki{z)Rkj(z:)vj(v,R N (z)u) J . (2.66) 

y i,j,k:i^k 



The subsums (|2.64j) and (|2.66j) are bounded from above by O ( | j m z| 4 jv ) • ^ nc 
calculations are very similar to the ones used above and are left to the reader. The 
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subsum ()2.65p can be written as 



E 



\ i,j,k:i^k J 



To estimate it, we write 

l E (lu372^ ^2 K 3{i, k)uiRii(z)R k k{z)Rkj(z)Vj)W)) - j^/2^-( ^2 K 3(i,k)uiRi l (z)R kk (z)R kj (z)v J )Er]W\ 

i,j,k:i^k i,j,k:i^k 

(2.67) 

<^(V( ^{hk^R^Rk^Rk^Vj)) 1 ' 2 {Y{ m )) 1/2 . (2.68) 

It follows from the estimates in (|2.17|) that one has a deterministic upper bound 
X! K 3(i,k)uiRii(z)Rkk(z)Rkj(z)vj\ < const— — ^ 

Thus, 



N3/2 v ' v 1 " r, "" w ^ l ' JI " |3mzpVjV 



E 



^ttj X! K3 (*' k ) u i R ^( z ) R kk(z)Rkj{z)v j i] N ) J (2.69) 



i,j,k:i^k ' IV- 7 

(2.70) 



Combining the estimates (|2. 53112. 701) , we obtain that the third cumulant term (I2.52[) 
contributing to rjy in (|2.38l) can be written as 

-^^■E( ^ K 3 (i 7 k)u l R i i(z)R kk (z)R k: j(z)v j )Ef]W (2-71) 

i.j,k:i^k 

+ o( 1 —f=)(^Vn)) 1/2 + o(—^-—). (2.72) 

\\Jmz\ 3 VNj \\3mz\*N) v 1 

Somewhat long but straightforward calculations using (|6.4M6.5[) and (|2. 42112. 51]) 
show that the fourth cumulant term in rjv in (|2.38[) can be estimated from above 

by O ^| j m "^|Bjv ) • Since the calculations are very similar to those in (|2.19I l2~2"Tj) . 
we leave the details to the reader. In a similar fashion, the error terms in rjy, due 
to the truncation of the decoupling formula at p = 3 for i ^ k and at p = 1 for 
i = k arc bounded from above by O ( j j m "^|6jv ^ • The considerations are similar to 
those given in the analysis of (I2.22[) . 

Combining (|2.41l) . (I2.50H2.51]) , (|2.7HI2.72")) . and the bounds on the fourth cumu- 
lant term and the error terms discussed in the above paragraph, one rewrites the 
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Master equation (|2.38H2.39[) as 

zE(\ m \ 2 ) = (u,v)E^ + a 2 E(\ m \hr N R N (z)) (2.73) 

+ /p 6 (P r ny (2J5) 



Using (|2.28p . we estimate 

|2n\ 1/2 ,,,, „ D 1 



|E (\ m \ 2 tr N R N (z))~g N (z)E\ m \ 2 \ < (V(M 2 )) V2 (Ytr N R N (z)) 1/2 = O 
This allows us to write 

zE(\r] N \ 2 ) = (u,v)Er] N + a 2 g N (z)E\i] N \ 2 + j^j^^i K3 ^' k)u l R li (z)R kk (z)R kj (z)v j )ErJW 

i.j,k:iy£k 

(2.76) 

Subtracting the r.h.s. in (|2.35|) from the r.h.s. in (|2.76p . we obtain (|2.32[) . Lemma 
12. H is proven. □ 

Now, we are ready to finish the proof of Proposition ^. II To obtain the estimate 
(|2.4j) from (|2.32l) . we use the same arguments as in Section 3 of [25] and Section 2 
of [3T]. We note (see e.g. (3.9) in [2SJ) that 

g N (z)(z - a 2 g N (z)) = 1 + ( P4(l y 1} ) . (2.77) 

We define 

O n := {z : \ 3mz\ > LN~ 1/4 }, 

where the constant L is chosen sufficiently large so that the O ( J " z ^ — term 

on the r.h.s. of (|2.77[) is at most 1/2 in absolute value. Multiplying both sides of 
(|2.32p by gjv(z), and using (j6.8[) , wc obtain that 

Y((u( n \Rn(z)vW)) = ^Y((u( N ),R N (z)v( N )))0 (M? m 1 
'Pii^mz^Y 



N 



(2.78) 



for z e Oat. It follows from (|2~78l) that 



V(( M W,i? w (^ (A °» = O ( M^^LT j for z g Gjv . (2.79) 

On the other hand, if| Jmz| < LiV -1 / 4 , then jyT^p > 1- Since {{u^, R N (z)v^)\ < 
we have 



I 3mz\ 



V«„<»>, *„(,)„<»>»< ' Sjf^p,. (2-80) 
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for z such that |3mz| < LA^ 1 / 4 . Combining ([2J9]) and ([2780}t . we obtain (12.41) . 
This finishes the proof of Proposition 12.11 □ 



3. Proof of Theorem 1.6 

Proof. Our exposition follows closely the ones in Section 3 of [31] and Section 4 of 
[28] . In order to extend the estimates of Proposition 12 . 1 1 to a more general class of 
test functions, we use the Helffer Sjostrand functional calculus (see [23], [IB"]). 

Let I be some non-negative integer, and / G C l+1 (R) decay at infinity sufficiently 
fast. For any self-adjoint operator X we can write 

t(TT\ 1 f 9 f 1 A A 9 f 1 f 9 f ,- d f\ ,, n x 

where: 

i) z = a; + with x, y € R; 

ii) f(z) is the extension of the function / defined as follows 

/» : =(£Z!?!(EjW )„(„); (3,, 

here a € C°°(R) is a nonnegative function equal to 1 for \y\ < 1/2 and 

equal to zero for \y\ > 1. 
The integral in (13. ip does not depend on the choice of I and the cut-off function 
(see e.g. [16]). Using the definition of / in (|3.2I) one can easily calculate 

df 1/0/ .0/\ 

ai = 2fe + ^J (3 ' 3) 

n=0 y 

and derive the crucial bound 

^(x + iy^K^max^WllKjKl+ljlvl 1 . (3.5) 
For X = X N , dU implies 

(u, f(X N )v) = -- [ ^(u,R N (z)v)dxdy. (3.6) 

7T J c dZ 

To prove (|1.34j) . we let i = 7 in (|3.2[) and assume that / has compact support. It 
follows from (f2~2j) that 

E(u, f(X N )v) = -E- / %(u,R N {z)v)dxdy (3.7) 

7T Jc dz 

= --(u, u) / ^J= g a {z)dxdy - - [ ^= e u . v {z)dxdy (3.8) 
7r J c dz 7r J c dz 

= (u,«) [ f(x)d(j, sc (x) - - [ ^e u ,v(z)dxdy (3.9) 

J 7T Jc OZ 



where 



\^)\<C 2 ^=^ (3.10) 
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uniformly on {z : fHez £ supp(f), \ 3mz\ < 1}, and C2 is a constant depending on 
supp(f). We conclude that the second term on the r.h.s. of (|3.8[) can be estimated 
as follows 

I- / -z= e u ,v(z)dxdy\ < - / |ttt e u>v (z)\dxdy (3-11) 

< CiC 2 ||/|| c . ([ _ £iZ]) -^ y ^/(z) y d2/X CT (y) (3.12) 

where x/ an d Xo- are the characteristic functions of the support of / and of a 
respectively, and L is such that supp(f) C [—£,£]. This proves (|1.34[) . 

To prove (|1.35p . one considers / G C 9 (R) (so 1 = 8) such that 11/119,1.+ is finite. 
Using (|2.3p . one replaces the estimate (|3.10[) with 

|e u ,„(*)| < cJ-^J^Psdlmzr 1 ), (3.13) 
V TV 



valid on C \ K, which leads to 

\- I %e U;V (z)dxdy\<- [ \% e u , v (z)\dxdy < <7 1 <7 3 ||/||9,i, + ^=. (3.14) 
ir J c dz 7T J c dz y/N 

To prove (|1.33p . we consider / G C 5 (R) such that ||/||5,i < 00, and let I — 4 in 
dHU). Then 

— — o / / Cov ((u, R N (z)v), (u, R N (w)v)) dxdydsdt, 

n Jc Jc dw 

where z = a; + iy, w = s + it. Taking into account (|2.4I) . we get 

V(( W) /(Xjv)w)) < ^ / [ \ %\\^r\Vm^RN (z)v))y/V((u, R N (w)v))dxdydsdt 
7T Jc ic oz dw 

-^(jj%\ p4{rjmzndxdy ) ■ (3 - 15) 

Plugging (I3.5P with Z = 4 in (|3.15[) . we prove (|1.33p . Thus, we have proved the 
parts (i) and (ii) of Theorem 1 1.61 

Now, let us assume that the marginal distributions of the entries of Wn satisfy 
the Poincare inequality (|1.12p with a uniform constant v and prove the parts (iii)- 
(v), i.e. the estimates (fi~37l) . (fl~39l) . and (fl~40| . Since the proof of (jl.37Hl.40j) is 
very similar to the proof of Proposition 3.3 in |31j . we discuss here only the main 
ingredients. 

The first important observation is that if f(x) is a Lipschitz continuous function 
on M. with the Lipschitz constant |/|,c,r then on the space of the TV x TV real 
symmetric (Hcrmitian) matrices, the matrix-valued function f(X) is also Lipschitz 
continuous with respect to the Hilbert-Schmidt norm f |15j. Proposition 4.6, c)). 
Namely, 

\\f(X)-f(Y)\\ HS <\f\cM\X-Y\\ HS , (3.16) 
where the Hilbert-Schmidt norm is defined as 

\\X\\ HS = (Tr(|X| 2 )) V2 . (3.17) 
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In particular, if u and v are unit vectors, then 

G(X N ) = (u,f{X N )v)) (3.18) 

is a complex-valued Lipschitz continuous function on the space of N x N real 
symmetric (Hermitian) matrices with the Lipschitz constant 

\G(X)-G(Y)\ 

\G\c ■= supx^Y—rr^ TTTi = |/k,R. 

II-* - Y Whs 

The second observation is that joint distribution of the matrix entries 

{X ih l<i<N, V2X jk , l<j<k<N} 

of Xjsr satisfies the Poincare inequality with the constant \Nv since the Poincare 
inequality tensorizes f |21|. [T]). Therefore, for any complex- valued Lipschitz contin- 
uous function of the matrix entries with the Lipschitz constant \ G\c, the distribution 
of G(Xn) has exponential tails (see e.g. Lemma 4.4.3 and Exercise 4.4.5 in [1]), i.e. 

V(\G(X N ) - EG(X N )\ >t)< exp . (3-19) 

where K is a universal constant, 

K = -^2 i log(l - 2- 1 A- 1 ). 

i>0 

This proves {OS}. 

Applying (j3. 19[) to the spectral norm ||X|| of the matrix Xm and using the 
universality results for the largest eigenvalues (see [24] and references therein), we 
obtain 




P(|prjv||-2£r| >t)< (2^ + o(l))exp ( -—^1 j . (8.20) 
and, in particular, 

P(||Xjv|| > 2cr + 5) < (2K + o(l)) exp I - ^^--5 1 . (3.21) 




Let f(x) be a real-valued Lipschitz continuous function on [—2a — 5,2a + 5]. 
Then, we can find a function fi(x) that is Lipschitz continuous on R, coincides 
with / on [—2a — 6,2a + S], and satisfies |/i]£,r = It follows from p.21j) 

that (u, J(Xn)v)) does not coincide with (u, Ji(Xn)v)) on a set of probability at 

most (2K + o(l))exp y-^-t) , which implies (fl~37]) . The details are left to the 
reader. □ 

4. Outliers in the Spectrum of Finite Rank Perturbations of Wigner 

Matrices 

This section is devoted to the proof of Theorem 11.21 
Proof. For x G (2<j, +oo), 
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is decreasing and g a (2a + 0) = 1/a. Let us choose 5 > in such a way that 

> (o \ on for a11 1 ^ 3< J*+ , (4-2) 
g a \2a + 25) 

i.e. for all 9j that correspond to the outliers (so 8j > a). Let 

L := max(6j, 1 < j < J CT + ) + 2a + 28. 

It follows from 1111. 41) (see e.g. [4], [2], and the proof of Proposition 2.1 in [28] ) 
that there exists a random real symmetric Wigner matrix Wn that satisfies ()1.1I 

H3D, 

P(Wjv = Wat) 1 as N -> oo, 

and 

||WW/V^V|| ->• 2<r a.s. (4.3) 
Without loss of generality, we can assume that Wn = Wn, so 

\\W n /VN\\ ->• 2a a.s. (4.4) 



It follows from the definition of M^r and (|4.4I) that, with probability 1, the deformed 
random matrix Mjv = jjWn + A/v has no eigenvalues bigger than L for all but 
finitely many N. Let u — u^ N \ and v = be nonrandom unit vectors in C N . 
Define 



£, N {x) := ViV ((u, Rn(x)v) - g a (x)(u, v)) , x e [2a + 26, oo), (4.5) 

Cjv(x) :=^^ = -ViV : (( W ,i?2,( a ;H+. 9 ;(x)( U ,«)), z G [2a + 25, oo), (4.6) 

|^(a;):=VJV : (<u,(fc(^Jv)i2w(a;))«)-fo(a!)<u,w)), x G [2ct + 2<5, oo), (4.7) 

oo), 
(4.8) 



Cw(ar) := = ->/5v((u, (M^aO^a^))") + <£(z)(u,u)) , i€ [2a + 25,oo), 



where h G C£°(R) such that 

h(x) = 1 for x G [-2a-<S/2,2a + <5/2], (4.9) 
ee for a; £ [-2a - 8,2a + 8\. (4.10) 

We claim the following lemma. 

Lemma 4.1. 

P (max (| Cat (s) |, a; G [2a + 25, L]) < log(iV)iV 1/6 ) -> 1, (4.11) 
where (,n{x) is defined in 



Proof. It follows from (|4.4|) that £/v(x) = £,n(x) and CwO^) = CaK^) f° r au x € 
[2a + 25, L] and all but finitely many N almost surely. Thus, it is enough to prove 
the result of the lemma for £at(x). Consider an equidistributed finite sequence 



x = 2a + 25 < x\ < X2 < ■ ■ ■ < xu 



N)i 
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where x l+ i - x t = N 1/3 , < i < l(N) - 1, and a^jy)-! < L < xi(N). Clearly, 
the number of elements in the sequence is 0{N^ 3 ). We have 



(|Ctf(zi)|, < * < KN) ~ 1 ) > l l°g(W 1/6 | (4-12 



W , , , 

< \\C N (xi)\ > ^logW^/e 

i=0 ^ ' 



(4.13) 



1 ' W / / \2\ 

* (i^iVV3 g (nCN^)) + (e(Ca(^))) J ■ (4-14) 



It follows from Theorem 11.61 that 

V(6v(^)) = 0(1), (4.15) 

E(6v(*i)) - 0(1), (4.16) 

uniformly in < z < l(N) and iV > 1. Indeed, 

C N (x) = x/iV - (u,v) J* /<*>(*)d/i. c (t)) , (4.17) 

where 

fi*)(t) = -h{t)j^-^ (4.18) 

is a C^°(M) function such that ||/^||5,i and ||/^||c 8 (K) are uniformly bounded in 
x e [2ct + 2(5,oo). Thus, (|4.15I4.16|I follow from fjl.33lll.34p . The bounds (|4.12M.14p 
and (j4.15l4.16p then imply 

ax(|Ov(**)l, < i < l(N) l) > Uog(N)N^ < (4.19) 

Taking into account that | ^%jr^ I < const\/~R\\u\\ \\v\\ , we arrive at (|4.1ip . Lemma 

14.11 is proven. □ 



Now, we are ready to start the proof of Theorem ll.2l Let us denote by i/ 1 ' , . . . , vS r > , 
the orthonormal eigenvectors of An corresponding to the non-zero eigenvalues. We 
recall that we used the notation 9\ > . . . > 0j o = > . . . > 6 j for the (fixed) eigen- 
values of An, and denoted the (fixed) multiplicity of 6j by kj. The zero eigenvalue 
9j = has multiplicity N — r. Clearly, J2j^j % = r - Let us denote by the r x r 
diagonal matrix built from the non-zero eigenvalues of A n, 

6 := diag(6i, . . . ,9i, . . . ,6j -i, . . . ,6j -i,6j 0+ i, . . . ,6j +i, ...,6j,... ,0j). (4.20) 

Let us also denote by Un the N x r matrix whose columns are given by the or- 
thonormal eigenvectors . . . , of A jy. Clearly, 

A N = U N eU* N . (4.21) 

For any x £ [2a + 25, L], we define the r x r matrix Sjy(a;) as follows. Let 

(Hjv^))^ ^VN^iRNWu^-g^Sij) , 1 < i,j < r. (4.22) 

The first step in the proof of Theorem 11.21 is the following lemma from [S] . 
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Lemma 4.2. Suppose that x is not an eigenvalue of Xn- Then x is an eigenvalue 
of Xn + Am with multiplicity n > 1 if and only if ga(x) is an eigenvalue of the 
r x r matrix 

Z N (x) := 9- 1 - -^Z N (x), (4.23) 



<N 

with the same multiplicity. 

For the convenience of the reader, we sketch the proof of Lemma f4 . 2 1 below . 

Proof. Let x £ Sp(Xn). Therefore Rn(x) = (xIn — Xn)^ 1 is well defined, and 

detp^Ar +A N - xI N ) = det ((X N - xI N )(Id N - R N (x)A N )) . (4.24) 

We obtain that for x ^ Sp(Xn) that x € Sp(Xn + An) if and only if 

det (I N - R N (x)A N ) = det (I N - R N (x)U N eU* N ) = det (I r - eU* N R N (x)U N ) = 0, 

(4.25) 

where one uses the identity det(I — BC) — det(7 — CB). Rewriting 

i r - qu* n r n {x)u n = -e (u* n r n (x)u n - e- 1 ) , 

one finishes the proof of Lemma 14.21 □ 

Proposition 11.11 plays an important role in the proof of Theorem 11.21 Before we 
prove Proposition 1 1.1[ we need to introduce some notations and prove Lemma l4.3l 

Consider a family of r x r matrices Z^(x) defined in (j4.23[) for x € [2a + 26, L}. 
Fix an eigenvalue 9j of An such that 9j > a and use the notation . . . , v^ k ^ 
for the eigenvectors of An that correspond to the eigenvalue 9j. Without loss of 
generality we can assume that j = 1. We do it just to simplify notations. The 
case 1 < j < J a + is identical. We recall that is defined in (|1.25|) as the 
kj x kj submatrix of S^v (pj ) restricted to the rows and columns corresponding to 
uW, 1 < i < kj. The central role in the proof of Proposition 11.11 is played by the 
following lemma. 

Lemma 4.3. Let Zn(x), x € [2a + 28, Z], be as in \4.23\) , with 2jv(x) defined in 
UMj , and 6 defined m UJUj) . Let 

zi(x) < z 2 (x) < ... < Zr(x) (4.26) 

be the ordered eigenvalues of Zn(x). Then, for sufficiently large constant C > 0, 

log N \ 
\z l (x)-z l (y)\<Cj^\x-y\, Vz, y e [2a + 25, L], i = 1, . . . , rj 1, 

(4.27) 

as N — » oo, and 

Zi(pi) = ^ + 0(-^=), l<i<h, (4.28) 



h WN 

in probability, i.e. \/N(zi(pi) — j^) is bounded in probability, 1 < i < k\. 
Below, we prove Lemma 14.31 



Proof. We claim that (|4.27|) follows from Lemma 14.11 Indeed, (|4.11j) and (|4.6j) 
imply that 

¥(\\~ N {x)-~ N {y)\\ ^Constlogi^N^^x-yl Vx, y € [2a + 26, L]j -> 1, 

(4.29) 
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as N -> oo. Since |z,(a;) - Zi(y)\ < \\Z N (x) - Z N (y)\\ = -^\\E N (y) - E N (y)\\, 1 < 
i < r, we conclude that (|4.29j) implies (I4.27|) . 
To prove (|4.28l) . we use the fact that 

\\z N (x) - e- 1 !! = 11-^=3^(1)11 = 0(4=), (4.30) 



'N VN 
in probability. Indeed, the entries of the r x r matrix 3w-(a;) are bounded in prob- 
ability since the expectation and variance of 

i%(x) := ((u^,h(X N )R N (x)uW) - g„(x)6^ ,l<i,j<r, 

is uniformly bounded by Theorem [L6l and 

Z%(x)=$(x), l<i,j<r, 

almost surely. Thus, ||3jy(a;)|| is also bounded in probability. Since the first kx 
eigenvalues of _1 are equal to we obtain (|4.28[) . Lemma |4~31 is proven. □ 

Now, we are ready to prove Proposition ll.il 

Proof. By Lemma l4~2l the outliers of + are given by those values of 

x E [2cr + S,M] such that 

g a (x) = Zi(x) for some 1 < i < r. (4-31) 
We recall that g a (x) is a monotonically decreasing function on [2a + S, M] and 

g' a (x) < const(cr, 6, M) < 0, x £ [2a + S, M]. (4.32) 

We note that 

9M = (4-33) 

Since for 1 < i < kx, (14.281) gives us that Zi(px) — g a {pi) — 0(^t=} in probability, 
it follows from (|4.31[) and (|4.27l) that with probability going to 1, there exist M > 

xi > x 2 > ■ ■ ■ > Xk t > 2a + S such that g a {xi) = Zi{xi), 1 < i < kx, and 

VN\x t -px\= 0(1), l<i<kx, (4.34) 
in probability. Applying (I4.27|) one more time, we get that 

gM) = *i(Pi) + O ( l0g( y 1/6 ) ,l<i<kx, (4.35) 

in probability. By a standard perturbation theory argument (see e.g. section XII. 1 
in [32]), one proves that the first fci smallest eigenvalues of the matrix Zjy(px) differ 
from the (increasingly ordered) eigenvalues of the k± x k\ matrix ^-Id — -^=S^ n '* by 

at most O (jj) , in probability, where the matrix E^ has been defined in (jl.25l) . 
To see this, we use the following standard lemma from the perturbation theory 

Lemma 4.4. Let B be an n x n real symmetric (Hermitian) matrix that can be 
written in the block form as B = (-By)i j=i,2i where Bij is an n, x rij matrix. 
Suppose that all eigenvalues of Bn are smaller than all eigenvalues of B22 and the 
gap between the spectra of Bu and B22 is at least Const > 0. In addition, suppose 
that the operator norm of the off diagonal block B\2 is bounded from above by e, so 
that \\Bx 2 \\ = \\B 2 i\\ < e. 

Then there exists const(Const,n) such that the first n\ smallest eigenvalues of 
B differ from the (increasingly ordered) eigenvalues of Bn by at most conste 2 . 
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Proof. We sketch the main idea of the proof for the convenience of the reader. 
First of all, one can assume that the eigenvalues of Bu are degenerate. In addition, 
one can assume that the blocks Bu and B22 are diagonal matrices. If not, one 
can simultaneously diagonalize them without changing the bound on the operator 
norms of the off-diagonal blocks. Thus, Bu = diag{X\, A2, . . . , A ni ), and B22 = 
diag(X ni+ i, . . . , A n ). Then the eigenvectors of Bu are given by ex, . . . e ni , and the 
eigenvectors of B22 are given by e ni +i, . . . e„, where ej, 1 < i < n are the standard 
basis vectors in C n . We recall that 



Ai < A2 < . . . A ni < A ni +i < ... A 
and A ni +i — A ni > Const. Then it is easy to see that 

((S-AiKe,-) 



n ■ 



ei = ei 



E 



Ai — A,- 

3>ni J 



is an approximate eigenvector of B with the approximate eigenvalue Ai such that 

{ B-X 1 )e~ 1 =Y, {{B ^: ej) {B-X j )e j . 

j>ni 3 

Since \\(B — A 3 -)ej|| < e, 1 < j < n, and Xj — Ai > Const, m < j < n, we obtain 
that 

|| (B - Ai)e~i|| < conste 2 , 

where const depends just on Const and n. The last inequality also holds (albeit 
with a different value of const) if one replaces e% by the normalized vector jjfnr- 
Thus, B has an eigenvalue in the conste 2 - neighborhood of A*. □ 

The result of the lemma can be immediately extended by induction to the case 
of rn x m block matrices B = (-Bij )i<ij< m . To apply it in our setting, we note that 
the fci x fci matrix j^Id — ^^S^ 1 ' is the upper- left block of Zn(pi). The other 

diagonal blocks of Zn(pi) are given by sjj- , 1 < i < m — 1 defined in (ll.25[) . Since 
the operator norms of the off-diagonal blocks of Zn(pi) are Q(A^~ 1 / 2 )f5ee (|4.30l) ). 
the desired statement follows. 
Therefore, we have 

1 1 floe(N)N 1/6 \ 
9a{pi) + (g'M + o(l)) {Xi - Pl ) = -j-- -j= Vl + O ( J ,l<i<k u 



(4.36) 
'repo- 
sition O now follows from (|4~36)) and (|4~33l) . □ 



where y\ > . . . > are the eigenvalues of the matrix 5 ^ . The result of Propo 



Since the eigenvalues of the matrix 3jy (p±) are bounded in probability, the first 
part of Theorem 11.21 i.e. (| 1 . 14|) . follows from (| 1 . 26[) in Proposition ll.il 

Now assume that the marginal distributions of the matrix entries of Wn satisfy 
the Poincare inequality (|1.12p with a uniform constant v. Our goal is the almost 
sure bound (|1.15|) on the rate of convergence of the outliers. We note that one 
can improve (I4.27[) and (|4.28l) in Lemma 14.31 as follows. Applying (|1.39II1.4U|) to 
(|4.17|) and taking into account that £n(x) — £n(x) and (n(x) — (n(x) for all 
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G [2cr + 26, L] on a set of probability 1 — [cup y — , j j , one proves 

max (| Cat (a;) |, a; G [2tr + 25,L]) < Consh log(iV), (4.37) 
max ( |z. ( (x) - ^-|, a; G [2a + 2(5, L] ] < Const 2 log (ffl , 1 < i < fci, (4.38) 

V "i 7 va 



almost surely, where Const\ > 0, Const2 > are sufficiently large, improving 
(|4.11[) . Reasoning as before, (|4.37[) implies that 

|*i(a>) - Zi(y)\ < Const 3 ^)=+\x - y\, Vz,yG [2a + 2<5,L], i = l,...,r, (4.39) 

v A 



almost surely for sufficiently large constant Const^ > 0. Thus, we have 

gM+tfM + o(i)) ( Xl - Pl ) = ±--L yi +o O^p-) ,i<i<h, (4.40) 



almost surely, which implies ()1 . 15[) since g a (p x ) — i. Theorem O is proven. □ 

5. Proof of Theorems 1.3, 1.4, and 1.7 

In this section, we prove Theorems EH EH and We start with Theorem 
IP 



Proof. Let 8j > a be an eigenvalue of An with the multiplicity kj. Let us assume 
that Case A takes place. Thus, without loss of generality, we can assume that the 
eigenvectors of An corresponding to the eigenvalue 6j belong to Span(ex, ■ ■ ■ , e^ ), 
where Kj is a fixed positive integer. As always, we consider the real symmetric case. 
The treatment of the Hermitian case is very similar. Consider a Kj x kj matrix Uj 
such that the (ifj-dimensional) columns of Uj are filled by the first Kj coordinates 
of the kj orthonormal vectors of A jy corresponding to the eigenvalue 0j . We recall 
that the remaining N — Kj coordinates of these orthonormal vectors are zero. Let 
us denote by Flffi^z) the upper- left Kj x Kj submatrix of the resolvent matrix 
Rn(z) = (zIn — A/v) -1 . Finally, we define the random matrix- valued field 

T N (z):=VN^R N <j \z)-g (T (z)I Kj '), z G C \ [-2a, 2a}. (5.1) 

It follows from the definition that T n(z) is a random function on C\ [—2a, 2a] with 
values in the space of complex symmetric Kj x Kj matrices. In particular, T^(x) 
is real symmetric for real x G M \ [—2a, 2a]. It follows from (|1.25l) . (|5.1I) and the 
definition of Uj in the paragraph above (|5.1I) that 

B# = U*T N { P j)Uj. (5.2) 

We recall that Theorem 1.1 in |3T] states that Tjv(z) converges weakly in finite- 
dimensional distributions to a random field 

T(z):=gl{z){w^+Y(z)), (5.3) 

where W<- K ^ is the Kj x Kj upper-left corner submatrix of the Wigner matrix 
W N (ll.llll.4L and Y(z) = (Y a (z)), Y u (z) = Y u {z), 1 < i,l < Kj, is a centered 
Gaussian random field with the covariance matrix given in (1.18)-(1.23) in [3T]. In 
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particular, for real iel \ [— 2a, 2a], one has that the entries Yu, 1 < i < I < Kj 
are independent centered Gaussian real random variables such that 

W(Y u (x)) = m(ji)£(x) - 2a 4 g'Jx), l<i< K h (5.4) 

W(Y u (x)) = -a 4 g' a {x), 1 < i < I < K 3 . (5.5) 

Now, Theorem 11.31 follows from Proposition II . II in this paper, and Theorem 1.1 in 
[3"Pj . since 

9<r{pi) = j:, and g' a {p 3 ) = —jp—^ - (5-6) 

j 

Theorem II. 31 is proven. The proof of Theorem II .41 is very similar to the given proof 
of Theorem ll.3l One has to use Theorems 4.1 and 4.2 and Remark 4.1 in [35] that 
generalize Theorems 1.1 and 1.5 in |31j to the non-i.i.d. case, and replace K4(/x) in 
(|5T4|) with 1 <i< Kj. □ 



Now, we turn to the proof of Theorem 11.71 

Proof. Recall that we extended the definition of T(z) to that of an infinite-dimensional 
matrix T(z) pq , 1 < p, q < oo, in the paragraphs above the formulation of Theorem 
11.71 We employ a standard approximation argument. For simplicity, we assume 
that 3mz ^ 0. If z is real, one has to replace Rn{z) by /i(X/v)i?Ar(z), where 
h 6 C^°(R) is defined in (|4.9ti4. 10]) . Let n be a sufficiently large fixed positive 
integer and consider N > n. It follows from Theorem 1.1 in [31] (see the proof of 
Theorem II .31 above) that the joint distribution of 

(u<"\ = (<u<»\ Jto(aOuW) - g a (z)(u^\u^)) , 1 <p,q < I, 

converges weakly to the joint distribution of (up , T ( z ) t 4™' > )7 1 < .P, <Z < ^- Choos- 
ing n sufficiently large, we can make 

v((4"\T(z)4"))~( Up ,T(zK)), l< P ,q<l, (5.7) 

and 

v((u("),T Af (z) u ("))-( u W,T(z) JvU W)), l<p,<?<;, (5.8) 

arbitrary small uniformly in N > n. Indeed, the variance in (|5.7I) is bounded by 
0(||uj) — tip"''* || 2 + \\u q — i4 || 2 ) since the entries of T(z) are i.i.d random variables 
with bounded variance on the diagonal and i.i.d. random variables with bounded 
variance off the diagonal. In addition, 

(uW,T N (z)uW) - (u^\r N (z)u^) = («(») - u W,T N (z)uW) 

+ «),T JV (z)«)-<))) 

and we can use the bounds (| 1 . 33[) and (|1.34[) in Theorem 11.61 rewritten as 

V((u,/(Ajv)«)) < constx c —— , (5.9) 

E(uJ(X N )v) - (u,v) j f(x)d l i sc (dx)\ < cons^ || / 1| C?(R) MM (5.io) 



-2<r VN 

to show that 

V (<«W - <',T,(z)«W)) , V («), T* (*)(«<"> - <)))) 
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are arbitrary small (uniformly in N) provided one chooses n sufficiently large. This 
finishes the proof. □ 

Theorem If .71 allows the following extension of Theorem If .31 

Remark 5.1. Let vP~\ . . . , %iS r ^ be a set of orthonormal vectors in l 2 (N) such that 

\\ U (P) - u { $\ = oiN- 1 ' 2 ), l<p<r, (5.11) 

where denotes the projection of onto the subspace spanned by the first N 
standard basis vectors e%, . . . , e^- Let Un be the N x r matrix whose columns are 
given by the vectors , . . . ,Un- Also denote by the r x r diagonal matrix 

6 = diag(9i, ... ,di,.. ., Oj -i, . . . , JO _i, #j 0+ i, . . . , 9j„+i, ■ ■ ■ dj, . . . , 6j). 

Finally, define 

A N = U N QU* N . 

The result of Theorem \1.3\ can be extended for such An, with the matrix Vj given 
by 

0f<i/ p \T(0,,>W), 1 < P ,q < kj. (5.12) 



6. Appendix 

The appendix contains several basic formulas used throughout the paper. 

First, we recall the decoupling formula from |25j . Let £ be a real- valued random 
variable with p + 2 finite moments, and be a function from C — > M with p + 1 
continuous and bounded derivatives. Then 

p 

K a +1 



mm) = E -^r E (^ (tt) (0) + (6.i) 



a. 

a=0 

where n a are the cumulants of £, |e| < Csup t |<^ p+1 '(t)|E(|£| p+2 ), and C depends 
only on p. If £ is a centered Gaussian random variable, the decoupling formula 
(|6.1[) becomes 

mm) = v(oew(o), (6-2) 

and can be immediately verified by integration by parts. 

Next, we write a basic resolvent identity. For any two Hermitian matrices X\ 
and X2 and non-real z we have: 

(zl - X 2 )- x = (zL - X^- 1 - (zL - X 1 y 1 {X 1 - X 2 ){zl - X 2 )-' (6.3) 

As a corollary of (|6.3|) . one has the following formulas. If X is a real symmetric 
matrix with resolvent R then 

— — = RkpRqi + RkqRpi , for p^g, (6.4) 

OApq 

av M — RkpRpi- (6.5) 
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In a similar way, if X is a Hermitian matrix then 
dRki 



dV\eX pq 



RkpRqi + RkqRpi, p 7^ q, 



■ i {RkpRqi — RkqRpi) , p =h q, 



d 3m X P q 

— — - HkpRpi- 
asi-pp 

Finally, we will use the following properties of the resolvent: 

= sst( z !sp(x )y ^ 

where by Sp(X) we denote the spectrum of a real symmetric (Hermitian) matrix 
X. The bound (|6.6[) implies 

\\R N {z)\\<rMz)\-\ (6.7) 

Therefore, all entries of the resolvent matrix are bounded by |3tn(z)| _1 . In a 
similar fashion, we have the following bound for the Stieltjes transform, g(z), of 
any probability measure: 

I^I^Jm^r 1 (6.8) 
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